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ABSTRACT 



Context. The imprint of Baryonic Acoustic Oscillations (BAO) on the matter power spectrum can be constrained using the neutral 
hydrogen density in the intergalactic medium (IGM) as a tracer of the matter density. One of the goals of the Baryon Oscillation 
Spectroscopic Survey (BOSS) of the Sloan Digital Sky Survey (SDSS-III) is to derive the Hubble expansion rate and the angular scale 
from the BAO signal in the IGM. To this aim, the Lyman-o- forest of 10^ quasars will be observed in the redshift range 2.2 < z < 3.5 
and over ~ 10, 000 deg^. 

Aims. We simulated the BOSS QSO survey to estimate the statistical accuracy on the BAO scale determination provided by such a 
large scale survey. In particular, we discuss the effect of the poorly constrained estimate of the unabsorbed intrinsic quasar spectrum. 
Methods. The volume of current N-hoAy simulations being too small for such studies, we resorted to Gaussian random field (GRF) 
simulations. We validated the use of GRFs by comparing the output of GRF simulations with that of the HoRizoN-4n A'-body dark- 
matter-only simulation with the same initial conditions. Realistic mock samples of QSO Lyman-o' forest were generated; the3ir power 
spectrum was computed and fitted to obtain the BAO scale. The rms of the results for 100 different simulations provides an estimate 
of the statistical error expected from the BOSS survey. 

Results. We confirm the results from Fisher matrix estimate. In the absence of error on the unabsorbed quasar spectrum, the BOSS 
quasar survey should measure the BAO scale with an error of the order of 2.3%, or the transverse and radial BAO scales separately 
with errors of the order of 6.8% and 3.9%, respectively. The significance of the BAO detection is assessed by an average A^' = 17 
but for individual realizations Ax' ranges from 2 t o 35. The error on the unabsorbed quasar spectrum increases the error on the BAO 
scale by 10 to 20% and results in a sub percent bias. 

Key words, cosmology dark energy - cosmology: LSS - Galaxies: IGM - Galaxies: quasars (absorption lines) - Method: numerical 



1. Introduction 

Constraining the properties of dark energy that drives the ex- 
pansion of the Universe is key towards understanding cos- 
mology. Baryonic acoustic oscillations (BAO) in the baryon- 
photon fluid of the pre-recombination Universe imprint the 
sound horizon distance at decoupling as a typic al scale in the 
matter correlation function or powe r spec trum (Peebles & Yu! 
1970t ISunvaev & ZeldovichI 119701: Eisenstein & Hu. .1998t 
Bashinskv & Bertschin^eil l2002h . These oscillation s were de- 



' tected both in the cosmic microwave background (e.g. 'Page et al.' 
^2003) a nd in the spatia l distribution of galaxies at lo w redshift 

■ dEisenstein et all 120051: ICole et alj l2005l iPercival et all |2009() . 

' Their measurements give important and cohere nt constraints on 
cosmological parameters jKomatsu et al.ll2009h . 

More recently, it was realized that BAOs could be detected 
in the Ly-a forest (McDonald 2003; White 2003) used as a 
probe of the intergalactic medium (IGM) at intermediate red- 
shifts (z ~ 2 - 3) and the potential of the measurement was 
quantified bv lMcDonald & Eisensteini (l2007h . The structure and 
composition of the IGM has long been studied using the Ly- 
a forest in QSO absorption spectra (Rauch 1998). The advent 
of high spectral resolution Echelle-spectrographs on 10 m-class 
telescopes has led to a consistent picture in which the absorp- 
tion features are related to the distribution of neutral hydro- 



gen (Hi) through the Hi Lyman transition lines. The IGM is 
believed to contain the majority of baryons in the Unive rse at 
these redshifts (Petitie an et al.lll993b Ipukugita et al.lll998h . and 
is highly ioniz ed by the UV -background produced by galax- 
ies and QSOs ( Gunn & Pet erson 1965), at least since z ~ 6 
(iFan et al . 2006: Becker et al. 2007). Photo-ionization equilib- 
rium in the expanding IGM establishes a tight correlation be- 
tween neutral and total hydrogen densities. Numerical simula- 
tions and analytical models support the existence of this cor- 
relation and show that the gas density traces the fluctuations 
of the DM density on scales larger than t he Jeans length (see 
forexamplefei et alJl992HCen et alJl994tlPetitiean et al.lll995l: 



iMiralda-Escude et aljri996l;lTheuns et al.iri998h . 
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In this paradigm, the IGM consists of mildly non-linear 
gas that traces the dark matter, and is photo-heated by the 
UV-background. Although metals are detected in the IGM 
(ICowie et alJll995l: ISchave et al.ll2003t lAracil et alJl2004l) . stir- 
ring of the IGM due to feedback from galaxies and AGNs proba- 
bly does not s trongly aff ect the vast majority of the baryons (e.g. 
iTheunsetalJ i2002b: M cDonald et al J i2005h . The relation be- 
tween the Ly-Q- forest flux and the underlying matter field is non- 
linear since fluctuations are compressed to the range < F < 1 . 
However, unlike galaxy surveys which sample only peaks in the 
matter field, the whole space along the quasar line-of-sight is 
democratically sampled. This is expected to lead to less scale- 
dependence in the bias compared to that observed in galaxy sur- 
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veys. The shapes and clustering of lines ha ve been extensively 
used to infer the temperature of the IGM ('Schave et al."l999'; 
[Ricotti et al. 2000; Theuns et al. 2000; McDonald et al. 2001) 
determine the amplitude of the UV-background (Rauc h et al.l 
119971; iBolton et al.l |2005[). trace the density s tructures around 
galax i es and quasars dRolUnde et all 120051; iGuimaraes et al.l 
l2007t iKim & Cro ft' 7008), constrain the reionization history 
of the Univer se (Theuns et al. 2002a; Hui & H aiman 2003; 
Fan et al.l2006l). measur e the matter power spectru m dCroft et al.l 



1999t IVieletaH l2004t iMcDonald et "aD 12006') or co nstrain 
cosmological parame t ers ([McDonald & Miralda-Escud g 1 1 9991; 
Rollinde et al. 2003", Coppolani et al. 2006; Guimaraes et al. 
2007; Viel & Haehnelt 2006) . Padmanabhan & White (2009) 
(see also Meiksin et al.l ll999h analyzed the amplitude of non- 
linear effects by comparing perturbative theory with outputs of 
ten dark matter numerical simulations. Although they focused 
on halos only, they demonstrated that the shift of the recon- 
structed BAO scale due to non-linearities decreases with redshift 
as D^(z), the square of the Unear growth factor. The simple linear 
bias between flux and matter power spectrum was also predicted 
by McDonald (2003) at low Ar-values, and by Slosar et al. (2009) 
and lWhiteet all (12009) at BAO scales. 

The observation of BAOs in the Ly-a forest requires a full 
3-dimensional sampling of the matter density, and therefore a 
much higher number density and number of quasars than pre- 
viously available. The Baryon Oscillation Spectroscopic Survey 
(BOSS) (Schlegel et al .ll2009l) of the S loan Digital Sky Survey- 
Ill (SDSS-III) (iEisensteinetal.ll201lf) aims to identify and ob- 
serve more than 150,000 QSOs over 10,000 square degrees. The 
QSO redshift range useful for BAO reconstruction is limited to 
z > 2.15 on the low side by the requirement that the Ly-a ab- 
sorption falls in BOSS spectrograph wavelength range. It is lim- 
ited to about z < 3.5 on the high side by the sharp decrease 
of the QSO density both intrinsically a nd for the magnitude ac- 
cessib le to the spectrograph (g < 22). iMcDonald & EisensteinI 
(|2007|) estimated, from standard Fisher matrix techniques and 
analytical description of the Ly-a power spectrum, that the den- 
sity of quasars should be of the order of 20-30 per square de- 
gree to achieve constraints of the order of 1% in the radial 
and transverse BAO scales, see also lMcOuinn & White! (l201 ih . 
Such a high requirement lead to new deve l opm ents on target 
selection (Palanque-Delabrouil le et al. 20101; Yeche et aL 120101; 
iBovvetal .1120 lit iKirkpatric k et al.ll201 iLlRoss et al.ll201 iL 

It is important to confirm the predictions on the BAO scale 
measure ments with additiona l work on numerica l simulations. 
Recently, ISlosar et all (l2009l) ; IWhite et al.1 (l2009h have studied 
the BAO signature in a typical Ly-a forest survey, using large A^- 
body simulations, but with still higher quasar density. This paper 
follows those works and investigates errors in the BAO scale es- 
timates as a function of the properties of the survey such as the 
density o f quasars and the amplitude of the noise. Very recently 
iGreig et al. (2011) have published a similar study but assuming 
all QSO located at the same z, with constant S /N ratio and a 
much smaller volume than the BOSS survey (corresponding to 
79 instead of 10,000 deg ^). In Sect. |2] a comparison with the 



^ 



sorbed spectrum is discussed. The resulting cosmological con- 
straints are presented in Sect. |5] and we draw conclusions in 
Sect.m 



2. Description of the simulations 

2.1. From Nbody to Gaussian random field (GRF) 
simulations 

The size of large N-body simulations is typically (2 Gpc/h)^. At 
z = 2.5 this corresponds to 745 deg^ which is much smaller than 
the 10,000 deg2 of the BOSS survey. As wifl be clear in Sect.O 
probing such a volume with the Ly-a forest of 20 quasars per 
deg^ results in a power spectrum or a correlation function where 
the BAO features are hardly seen: some realizations will exhibit 
them and some will not. A larger volume is therefore required in 
order to study the error on the reconstructed BAO scale. 

Such a volume can be provided by Gaussian random field 
simulations, which however do not contain any non-linear ef- 
fects. The HoRizoN-4n simulation was used to investigate the rel- 
evance of these effects for the study of BAO scale reconstruction. 
HoRizoN-4n is a ACDM dark-matter-only simulation based on 
cosmological parameters inferred by the WMAP three-year re- 
sults, with a box size of 2/1" 'Gpc on a grid of size 4096^. The 
purpose of this simulation is to investigate full sky weak lensing 
and baryonic acoustic oscillations. The 70 billion particles were 
evolved using the Particle Mesh scheme of the RAMSES code 
on an adaptively refined grid (AMR) with about 140 billion cells. 
Each of the 70 billions cells of the base grid was recursively re- 
fined up to 6 additional levels of refinement, reaching a formal 
resolution of 262,144 cells in eac h direction (rough ly 7 kpc/h 
comoving). The code mpgrafic (iPrunet et al.l 120081) was used 
to generate the initial conditions (ICs). 

For identical initial conditions, we compared outputs from 
HoRizoN-4n simulation with the linear density modified through 
a lognormal model to incorporate some of the non-linearities. 
To make this comparison statistically as powerful as possible we 
did not implement here all the complications of a realistic sur- 
vey. Transmitted-flux-fraction spectra were generated without 
any observational noise and extended in wavelength all along the 
box, instead of being limited to the Ly-a forest. In addition the x 
and y positions of those lines were chosen regularly which elim- 
inates the sampling noise (see Sect. 12. 21 and Eq.[3]). In this case, 
we can divide the simulation into eight individual boxes with 
the same volume and still see the BAO features in each box. The 
EFT of each box was computed to produce the power spectrum, 
which was Fourier transformed to give the correlation function. 
The correlation function was fitted to determine the BAO peak 
position. This was done both for the full HoRizoN-4n and for the 
lognormal densities, yielding kA - 0.05766 ± 0.00197 h/Mpc 
(LN) and kA = 0.05820 + 0.00192 h/Mpc (Horizon), where the 
errors are obtained from the rms of the 8 values. The two sets 
of values are correlated and the difference is 0.00053 + 0.00166, 
i.e. (0.9 ± 2.9)%. We conclude that we do not observe non-linear 
effect on the reconstruction of the BAO scale at this level of ac- 



HoRizoN-4n(I Prunet et al.ll2008h dark-matter-only A^-body simu- curacy, 
lation validates the use of linear Gaussian random field (GRF) 
to study BAO scale reconstruction. The production of realistic 
mock spectra is described, including quasar unabsorbed spec- 
trum, noise, and the effect of peculiar velocities. Standard meth- 
ods to analyze BAO signal in terms of power spectrum are pre- 
sented in Section [3] The performance of the survey for different 
quasar densities and noise amplitudes are presented in Sect. |4] 
and the effect of the error on the estimate of the quasar unab- 



2.2. Gaussian random field simulations 

Complex normal Gaussian fields were generated in a box in 
Fourier space with 6(-k) = 6*(k). The amplitude of each mode 
was multipli ed by the square root of the power spectrum P(k) at 
z = from lEisenstein&Hul (Il998h with //q = 71 km/s/Mpc, 
Q.,„ = 0.27, Qi = 0.044, Qa = 0.73 and w = -1. An inverse 
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Fig. 1. One-dimension power spectrum (P'^) for our simulation 
with chr = (lower, blue histogram), chr - 1 (magenta) and 
Chr = 1-2 (upper, red), where chr is defined in Eq.|2] compared 
to expectation from Eq.[T] (black continuous line). 



FFT provided a linear simulation of matter density fluctuations, 
6p/p, at z = in a box of 2560 x 2560 x 512 pixels of (3.2 
Mpc/h)^, i.e. 8179 Mpc/h in the transverse directions and 1636 
Mpc/h in the longitudinal direction. The transverse size covers 
12,500 deg2 at z = 2.5 and 9100 deg^ at z = 3.5, while the longi- 
tudinal size covers from z=1.75 (corresponding to A = 3344 A 
for Ly-a absorption) to z = 3.9. Peculiar velocities in the direc- 
tion parallel to the line of sights were computed from densities 

ikn H(z) dD 
in Fourier space using Vk,, - --rr -t^tt -^ (i>^k where D(z) is the 

k-^ D(z) oz 
linear growth factor 

The clustering of quasars was neglected and their angular 
positions were randomly drawn within the box. They were as- 
signed a redshift and a magnitude according to the distribution 
oflJiang et al. (2006). The lines of sight of the quasars were taken 
to be parallel. Real data will have to be analyzed taking into ac- 
count the angle between the lines of sight, but if we consistently 
produce the simulation and analyze the resulting spectra with 
parallel lines of sight, this should have a negligible eff'ect on the 
statistical error on the reconstructed BAO scale. 

The matter density, in pixels located along the line of sight 
of each quasar, was evolved to the redshift of the pixel by multi- 
plying Sp/p by D{z) at the redshift of the considered pixel. This 
provided us with density spectra in 3.2 Mpc/h bins. We did not 
interpolate to regular bins, e.g. in log{A), and then interpolate 
back to regular bins in Mpc to compute the power spectrum. 
Since we anyway average the flux over (12.8 Mpc//i)^ pixels be- 
fore applying a FFT to compute the power spectrum (Sect. 13.11 ). 
we believe this is a minor effect. 

Because QSOs sample only a very small transverse region, 
such a simulation misses a significant contribution to the power 
spectrum coming from transverse scales smaller than the 3.2 
Mpc/h pixel size. This is very clear for the ID power spec- 
trum, which i s an integral of the 3D power spectrum over kj_ 
dKaiser & Peacock. 1991.) and therefore includes large kj_ : 



P'"(kii) 



1 r°° 

= — p(k\\,k^)k^dk^ 

2-T Jo 



(1) 



This missing small scale contribution is illustrated in Fig. [T] 
where the ID power spectrum of the matter density for the sim- 
ulation with 3.2 Mpc//i pixels (lower blue histogram, Chr = 0) 



appears significantly lower than the expectation from Eq. \T\ 
(black continuous curve). In order to compensate for the miss- 
ing contribution, we have generated 20 high-resolution (HR) 
Gaussian random field simulations with a volume correspond- 
ing to one pixel of our large-volume low-resolution simulations 
(LR), i.e. (3.2 Mpc/h)^, and a pixel size of 200 kpc. When sim- 
ulating the matter density along a quasar line-of-sight, for each 
large pixel we randomly selected one of the HR simulations, and 
we defined the density in the small pixels as 



6p = (Sp)ui + Chr (Sp)hr ■ 



(2) 



As illustrated in Fig.[T] if we just add the LR and HR simulations 
(i.e. Chr = 1 in Eq. ^ the ID power spectrum of the matter 
density is still slightly smaller than predicted by Eq. [TPl . This is 
not surprising since we do not have any correlation between the 
HR simulations in neighboring large pixels. Using an effective 
correction factor chr = 1.2 results in a P^'^ik) which fits well 
Eq.[T] at least in the k\\ range relevant for BAO, see Fig.[T] 

McDonald & Eisenstein (2007) showed that the observed 
power spectrum, Pobs(^), is the sum of the true power spectrum, 
a sampling contribution and a noise contribution : 



,{k) = P{k) + P^^P'^ikiO + Pn, 



(3) 



where, in the absence of pixel weighting, P^ is the inverse of 
the surface density of quasars (in Mpc"^). Eq. [3] indicates that 
the accurate description achieved for P'^ at k\\ < 0.3, ensures a 
good description of Pobs(^) in the range relevant for BAO. 

The next step was to go from the matter density fluctuations, 
to QSO transmitted flux fractions F - exp(-T), where t is the 
optical depth. Note that F is the traditional notation for the trans- 
mitted flux fraction and we will use (p for the QSO flux. We used 
the relation 



F - exp 



6p 

-a{z)expb — 

P 



(4) 



This means the lognormal approa ch was used to get th e baryon 
density from our Gaussian fields dBi & Davidsen|[l997h and the 
baryon density was transformed into transmitted flux fractions F 
using the fluctuating Gunn -Peterson approximation dCroft et al.l 
IT99i:lGnedin & Hua[T998h . 

We followed the procedure of lMcDonaldl (l2003h to take into 
account the effect of the peculiar velocities: we accounted for the 
expansion or contraction of cells by translating each cell edge in 
real space into redshift space using the average velocity of the 
two cells that the edge separates. The optical depth contributed 
by each real-space cell was then distributed to multiple redshift- 
space pixels based on its fractional overlap with each. 

The value ofb in Eq.|4]was fixed to b = 2-0.7(y- 1) - 1.5 8 
for an equation of state parameter 7=1.6 (iHui & Gnedinll997h . 
The value of a(z) was fitted to reproduce the experimental ID 
power spectrum and the resulting mean transmitted flux fraction 
F(z) was checked to be in good agreement with the data, as il- 
lustrated by Fig. |2l We could alternatively have fitted F(z) and 
checked P'^. More precisely, McDonald et al. (2006) measured 
P'^ for Aril between ^ 0.14 and ^ 1.8/z/Mpc and 2.2 < z < 4.2, 
and in each bin in z we fixed a(z) to fit the first four bins in k, 
from =sO. 14 to a;0.28 h/Mpc. We did not fit higher k bins because 
our simulations do not include non-linear effects and are not ex- 
pected to fit data at high k. Note that it would have been more 



' Note that in Eq. [T]we integrated up to k± = n/(lOO kpc//?) which 
corresponds to a typical value of the Jeans' scale. This however is only 
a reduction of a few % relative to the integral up to infinity. 
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natural to use 100 kpc/h pixel size for the HR simulations, a typ- 
ical value of the Jeans scale. In this case one needs a correction 
factor chr-1-12 only, but one cannot simultaneously reproduce 
F{z) and P'°. 

Peculiar velocities introduce a dependence on ju = k\\/k in 
the redshift-space power spectrum : on large scales we have 
P{k,fj) = (1 +Piu ^y^Po(k), whe re P[)ik) is the isotropic real-space 
power spectrum (lKaiserlll987h . For galaxy surveys, /3 is related 
to the bias and the growth rate of structur e, but for Ly- g forest 
it is an independent parameter (McDonal d et al.ir2000h . Fig. [3] 
shows the ratio of the redshift-space over the real-space power 
spectra for our simulatiorQ. This ratio follows Kaiser formula in 
the k range relevant for BAO, k < 0.2. The departure at higher 
k is due to the fact that our procedure to implement the effect of 
velocities is only valid for scales larger than a few (3.2Mpc//!) 
pixels. The ratio of power spectra is unity in the transverse di- 
rection (// = 0) and about 5 in the longitudinal direction (/i = 1), 
which corresponds to /? - 1.2 in Kai ser formula. This is to 
be compared to (3 - 1.58 according to 'M cDonald! (!2003) sim- 
ulations and 0.38 < (3 < 1.05, as measured witii first BOSS 
data (ISlosar et al.ll201l[) . Note, however, that the value of y6 ob- 
tained by BOSS is contaminated by the presence of damped Ly- 
a systems and metal lines in the quasar spectra. We also observe 
a bias b -Q).\9 relative to the matter power spectrum, to be com- 
pai-ed to 0.17 <b < 0.25 measured with BOSS data. 

To get the flux, (pi{A), of quasar /, the transmitted flux frac- 
tion, Fi{A), must be multiplied by the quasar unabsorbed spec- 
trum, i.e. the quasar spectrum, including the QSO emission lines, 
if there wer e no absorption. Th e principal component analy- 
sis (PCA) of Suzuki et al.| (12005') was used to generate for each 
mock spectrum a random PCA unabsorbed spectrum, which was 
normalized according to the g-band magnitude of the quasar 
Noise was added according to the characteristics of BOSS spec- 
trograph, including readout noise, sky noise and signal noise and 
assuming four exposures of 900 s, for each QSO spectrum. Fig.|4] 
presents the mean signal-to-noise ratio per lA bin in the Ly-c 
forest, which varies from 14 for a quasar magnitude nig - 19 



^ Note that the power spectra were obtained without noise and using 
all pixels of the box, not just those along some random QSO lines. This 
removes the contribution from the noise and sampling terms so that 
PdbA1<) = Pfik), see Eq.[3] 
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Fig. 4. Mean signal-to-noise ratio in the Ly-o; forest for 1 A bins, 
as a function of the quasar magnitude in the g band. 



to 1.6 for nig - 22. Fig. |5] shows an example of such a mock 
spectrum. The pdf of the transmitted flux fraction is presented 
in Fig.|6] For high resolution bins this pdf exhibits peaks at zero 
and unity but for low resolutions bins, the flux is averaged and 
there is a single peak around F. In addition, due to noise the 
transmitted flux fraction can be larger than unity and also nega- 
tive. 



3. From spectra to BAO signal 

The mock spectra, produced as described above, were analyzed 
to reconstruct the power spectrum and the BAO scale. 



3.1. Power spectrum estimator 

The flux (pi in mock spectrum / must be divided by the product of 
F(z) and the quasar unabsorbed spectrum c, in order to provide 
the fluctuation of the transmitted flux fraction, df(A), as 



Sf(A) 



F(z)ci(A) 



1. 



(5) 



This requires estimates of F(z) and c,(/l). In Sect. 14.21 we will 
discuss various methods of doing this. In this section we use the 
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Fig. 5. Mock spectrum of a quasar with redshift z - 3.02 and 
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true values of F(z) and c,, a procedure which, as we will see, 
only slightly overestimates the precision of the determination of 
the BAO signal. A grid with (12.8 Mpc//;)^ cells was filled with 
the part of the bj; spectra which corresponds to the Ly-a for- 
est (we selected 1041 < X < 1181A, in the quasar rest frame). 
An unweighted average of bj; was calculated for all pixels of all 
Ly-a forest spectra lying in a considered grid cell. Some grid 
cells do not contain any Ly-c forest spectra. The average of all 
other cells was computed and this average was subtracted from 
the content of all filled cells, while the content of unfilled cells 
remained zero. This was done in order to avoid including the 
Fourier transform of the quasar spatial distribution in the power 
spectrum. This is analogous to subtracting a sy nthetic catalog 
in the case of galaxy surveys, as advocated by iFeldman et alj 
(11994 . 

A Fourier transform of the resulting grid was performed and 
the modulus squared computed for each mode (A:,, k^, k,), which 
gives the power for the considered mode. The angular-averaged 
power spectrum was then obtained as the average of the power 
for all the modes with \k\ in a given bin. Note, however, that 
we did not include purely radial (0, 0, k^) and purely transverse 
(kx, ky, 0) modes. The former correspond to Fourier transforms 
along quasar lines of sight. These are severely affected by the 
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Fig. 7. Power spectrum in the simulation (red upper histogram) 
compared to the expectation of Eq. |3](blue, just below red). The 
latter is the sum of the input power spectrum (black, decreasing 
quickly with k), the noise contribution (black, constant with k) 
and the sampling contribution (green, decreasing slowly with k). 



uncertainty on the quasar unabsorbed spectrum estimate. The 
latter have a very large sampling term contribution due to the 
large value of P^^{k\\ - 0). Purely transverse modes are 5.8% 
of all modes at the first BAO peak and 3.4% at the second BAO 
peak, while the number of purely longitudinal modes is less than 
0.01% of all modes at the first BAO peak. The error on P{k) is 
obtained as the rms of the power in all modes in the considered 
k bin, divided by the square root of the number of modes. The 
correlations between the power in diflferent k bins are expected 
to be small and we neglect them. 

The resulting power spectrum exceeds Eq.[3]by about 10%, 
as illustrated in Fig|7] We see that the sampling and noise contri- 
butions are of similar sizes and much larger than the input (LSS) 
power spectrum, which means that the BAO oscillations will be 
considerably diluted, as can be seen in Fig. [8] 



3.2. Reconstruction of the BAO scale 

To infer the BAO scale, the angular-averaged power spectrum 
was first fitted by a polynomial. The power spectrum divided by 
this polynomial was then fitted with 



(6) 



1 +A<.k exp — 



U-kV] 




k 




sin 


In— 


[\T J \ 




kA 



as suggested bv lBlake & Glazebrookl (l2003 h. This provides the 
(isotropic) BAO scale kA, as illustrated in Fig. [8] This figure cor- 
responds to an average realization with Ax^ - 16.8 and the BAO 
oscillations are quite visible. Their amplitude is less than a per- 
cent, which means they are considerably diluted relative to the 
input power spectrum where they are on the order of 10%. This 
is due to the sampling and noise contributions to the observed 
power spectrum, see Eq. [3] 

The upper limit of the polynomial fit range was set to 0.24, 
due to our 12.8 Mpc/h EFT cells which result in a maximum k 
of 0.245 /i/Mpc in x, y or z direction. The lower limit and the 
order of the polynomial were set so as to get a good x^ to the 
power spectrum obtained for some additional simulations which 
did not include the BAO features. These simulations were gen- 
erat ed using the power spec trum with baryons but without BAO 
by lEisenstein & Hul (Il998l) . We noted that using a polynomial 
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Fig. 8. Power spectrum for a realization of the nominal setup 
(15 QSO/deg^) div ided by a 7''' order polynomial and fitted with 
iBlake & Glazebroo k (2003) formula, see text. 



of too low a degree could result in a significant bias in the re- 
constructed BAO scale, while on the other hand increasing the 
polynomial degree beyond what is needed to get a good fit could 
degrade the performance (i.e. increase the rms of A:a) because the 
polynomial fit starts to fit out the BAO features. Depending on 
the studied scenario, the lower limit was set between 0.02 and 
0.05 /i/Mpc and the order of the polynomial was either 6 or 7. 
The range for oscillation fitting was set to 0.05 <k < 0.2 /i/Mpc, 
somewhat narrower to avoid edge effects in the polynomial fit. 
We anyway do not want to include in this fit high k values for 
which non-linear effects are important. 

Ten Gaussian random field simulations were produced. Each 
of them was used ten times, with different random seeds to gen- 
erate the quasar positions. One hundred values of Ua were then 
obtained and their rms provides an estimate of the statistical pre- 
cision on the BAO scale. This was done for several different 
scenarios and the results are presented in Sect. 14.11 The mean 
valu e of the error on Ua returned by the Minuit fitting pack- 
age jJames & RoosI [l975l) is found to be compatible with the 
rms of kA, which confirms that our estimate of the error on P{k) 
and the fact that we neglected correlations are both reasonable. 

3.2.1 . Radial and transverse BAO scales 

In addition to the anisotropy of P(k) due to peculiar velocities, 
the reconstructed power spectrum, Pobs(^), involves a stronger 
anisotropy due to the sampling contribution in Eq. [3] which de- 
pends on ^11 . It is difficult to find a functional form to fit the 2D 
power spectrum P(k^,k\\). Instead P(k^,k\\) was divided by the 
polynomial obtained from the ID fit (previous section) to pro- 
vide a "reduced" power spectrum, which was then smoothed us- 
ing algorithms implemented in the CERN ROOT package. The 
ratio of the reduced-power-spectrum to the smoothed reduced- 
power-spectrum was fitted with 



1 -H A -i A: exp 



sm 




(7) 



The rms from the 100 resulting values of Ar^ and k^^ provides an 
estimate of the error on the transverse and radial BAO scales. 
Note however that this 2D fitting is quite delicate. It does not 
work in all cases and a more sophisticated procedure would be 
needed to analyze real data. 



4. Results 

4. 1 . Results with the true quasar unabsorbed spectrum 

In this section we discuss the performance of BAO reconstruc- 
tion in the ideal case where in Eq. |5] we divided by the true 
F(z) and c,(/i). By default the simulation was performed for 15 
QSO/deg^ in the redshift range 2.2 < z < 3.5 and with magni- 
tude 18 < mg < 22. The results were slightly scaled to corre- 
spond to the 10,000 deg^ of the BOSS survey. Different scenar- 
ios in terms e.g. of number of quasars per deg^ or noise level 
were studied. Table[T]presents, for each scenario, the rms in per- 
cent for the isotropic (kA), radial (fcy), and transverse (k±) BAO 
scales. To assess the significance of the BAO detection, we also 
give the difference of ;t'" relative to the case with no BAO {A -0 
inEq.|6]l. 

We start without noise and not including the effect of veloc- 
ities (Table [1] line 1). In this case, the transverse scale is much 
better reconstructed than the radial one because there are more 
transverse than radial modes (there are two transverse direc- 
tions and only one radial). Including peculiar velocities ampli- 
fies the power spectrum in the radial direction and dramatically 
improves the radial scale reconstruction, as can be seen on line 
2. 

The nominal case is obtained when we add noise according 
to BOSS setup, which results in rms of 2.28 ± 0.16, 6.79 + 0.48 
and 3.86 + 0.27 per-cent for the isotropic, transverse and radial 
BAO scales, respectively (line 4 of TablelTJ. On average over the 
100 simulations, Ax^ is 17.4, which means a significant detec- 
tion of BAO features. This is, however, just an average and the 
difference of ;^'^ ranges from 2 to 35, as illustrated by Fig.|9] At 
this point, we note that the (2Gpc//i)^ Horizon simulation has a 
17 times smaller angular coverage than our simulation. The x^ 
differences for Horizon simulation would therefore be of order 
unity for BOSS Ly-a survey, so, as announced in Sect. 12.11 the 
Horizon simulation is clearly not large enough to study the re- 
construction of the BAO features with BOSS survey. 

For the same nominal setup, an updated version 
(R McDonald, private c omm. ) of the analytic estimate of 
McDonald & Eisenstein) (l2007h gives an error of 1.8% on 
the isotropic BAO scale when weighting pixels according to 
signal-to-noise ratio. Without weighting the computed error 
increases to 1.91%. Our result, 2.28 ± 0.16, is larger by a factor 
1 . 19 ±0.08, which is consistent with the fact that we find a power 
spectrum about 10 % larger th a n pre dicted by formula |3] as 
illustrated in Fig|7] iGreig et aP (1201 ih very recently published 
simulation results; they get an error of 1.38% on the BAO 
scale for a fixed ratio S /N - 5, whereas we have in average 
S /N = 3.1. When we decrease the S /N ratio by a factor 1.7, 
we observe an increase of the rms by a factor 1.75, so it is not 
unlikely that an increase of S /N hy a factor 5/3.1 = 1.6, reduces 
the rms by a factor 2.28/1.38=1.65. 

Comparison of lines 3, 4 and 5 in Table[T|illustrates the vari- 
ations of the performance with the quasar density. The sampling 
and noise contributions to the power spectrum (Eq. [1]) scale as 
1/Nqso- So, if we could completely neglect Ppik) in Eq. [T] we 
would expect the rms of the BAO to scale as 1/Nqso- We ob- 
serve a decrease of the rms with the quasar density, which is not 
as strong as 1/Nqso but statistically compatible with the N^^^^ 
dependence found bv lGreig et al.l(l20I ll) . We investigated the de- 
pendence on the noise level. Line 6 shows that increasing the 
noise by a factor 1 .7 results in a significant degradation of the 
performance. As is clear from Fig.|7] increasing the noise power 
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Table 1. Rms of the BAO scale and significance of BAO feature 
detection, for different scenarios. 



Ax 

Fig. 9. Nominal simulation (15 QSO/deg^). Difference between 
the x^ for the nominal fit (Eq. |6]) and the x^ for no oscillations 
(A = in Eq.|6ll for the 100 realizations. 



QSO" 

(deg-2) 


Area" 
(deg2) 


Noise'' 


Velo" 


(%) 


kj 
(%) 


kif 
(%) 


Ax"- 


15 
15 


10,000 
10,000 






+ 


2.67 
1.60 


5.26 

5.77 


10.5 
2.84 


15.8 
34.2 


10 
15 
20 


10,000 
10,000 
10,000 


1 
1 
1 


+ 
+ 
+ 


3.13 
2.28 
2.20 


10.7 
6.79 
6.33 


5.27 
3.86 
3.20 


10.7 
17.4 
24.9 


15 
60 


10,000 
14,000 


1.7 
0.5 


+ 
+ 


3.99 
0.54 


10.8 
1.28 


6.00 
0.84 


12.4 
284 



Notes. *°* assumed number of QSO/deg^ **' survey size '^' scaling factor 
applied to the noise level, i.e. means no noise and 1 noise correspond- 
ing to BOSS nominal setup. *''' indicates whether the effect of peculiar 
velocities was taken into account or not. ''' rms for the isotropic BAO 
scale *'^' transverse BAO scale ''' radial BAO scale '''' significance of 
BAO feature detection 

With 100 simulations, the statistical error on the rms, is just 
rms/ V2OO = 0.071*rms, e.g. 0.16, 0.40 and 0.28% for the nominal 
case (line 4). 



spectrum by a factor 1 .7^ makes it the dominant contribution to 
the total power spectrum. 

We also considered the case of the BigBOSS project, assum- 
ing 60 QSO/deg^ up to a ^-magnitude of 23, over 14,000 deg^ 
(last line of Table. [T]|. With a 4 meter telescope and 5 exposures 
of 900 s instead of 2.5 m and 4 exposures, the noise is reduced 
by nearly a factor two relative to BOSS case. The resulting rms 
of the BAO scale is improved by a factor 4.2 relative to BOSS 
nominal case. 

Finally, we note that naively combining the rms on the trans- 
verse and radial scales in Table [1] results in an error which is 
significantly larger than the rms on the isotropic scale. This 
is (partly) due to a significant anticorrelation between the re- 
constructed transverse and radial scales. When the rms is quite 
small, e.g. in the nominal case with 20 QSO/deg^ or in the 
BigBOSS case, taking into account a correlation coefficient of 
typically -0.40, results in a combined rms in agreement with the 
rms on the isotropic case. In other cases, the combined rms is 
still larger by a factor on the order 1.2, up to a factor 1.4 for the 
case without velocities. This confirms that fitting the BAO fea- 
tures in two dimensions is difficult, in particular when the quasar 
density is low. 

4.2. Effect of unabsorbed spectrum error 

Using Eq.|5]to obtain 6f requires an estimate of F(z) x c,(/l) for 
each spectrum and the above results were obtained using the true 
value of this product. One can get a fair estimate of F(z) from 
the observed spectra, up to a normalization factor which is com- 
pletely degenerate with the normalization of the c,. On the other 
hand, with the resolution and the S /N ratio of the BOSS survey, 
the unabsorbed spectra, Ci(A), cannot be accurately determined 
from the observed spectra (see Fig. |5]). A possible approach is 
to fit a power law in A on the red side of the Ly-a emission 
line, to extrapolate it in the Ly-a forest, possibly multiplying 
it by some average shape of the unabsorbed spectru m as a func- 
tion o f Arf, the quasar rest-frame wavelength (e.g. ISlosar et al] 
l201lh . We could not do that because our mocks are based on 
PCA byiSuzuki et al. (2005) which extend only up to 1600 A in 
the quasar rest frame and therefore do not allow for a reliable 
power law fit. Reliable fits will be possible with the PCA pro- 



vided very recently bv lParis et al.l (1201 lb . extending up to 2000 
A. 

Instead, we computed the average spectrum in the forest, as 
a function of A,/. Then we divided each spectrum by this aver- 
age spectrum, fitted the result with a power law in A, and finally 
divided by this power law. This is dividing the mock spectrum 
by an estimate of F{z) x c,(/l). For the nominal setup, this results 
in an rms of (2.55 ± 0.16)% for the BAO scale, only a factor 
1.12 + 0.04 larger than what is obtained with the true unabsorbed 
spectrum. We also note that, while we did not observe any bias 
on the reconstructed BAO scale when using the true unabsorbed 
spectrum, with this approximate unabsorbed spectrum, there is a 
bias of -0.59 ±0.18%, which is, however, quite smaller than the 
2.55% statistical error on the BAO scale. In this procedure, since 
we do a power law fit along the quasar spectra, we remove large 
scale power in the radial direction, so all modes with low value 
of ^11 are strongly reduced and P{k) becomes very anisotropic. 
In this case, the smoothing used in the 2D fit procedure does not 
make sense. So we do not get results for the transverse and radial 
BAO scales, separately. A more sophisticated procedure would 
need to be developed. 

Another possible approach is to use PCA to predict the un- 
absorbed spectrum in the forest from the flux redward of the 
Ly-a emission line. However, we cannot use mock spectra gen- 
erated with PCA to study how precisely PCA reconstruct the 
unabsorbed spectrum. Instead we used 78 observed unabsorbed 
spectra, CohA^)^ an d their cor responding PCA estimates, cpcAi^) 
provided by Paris et alj (|2011). The observed unabsorbed spectra 
were manually estimated from high S /N SDSS II spectra, while 
the PCA unabsorbed spectra were obtained for each spectrum 
by a PCA analysis of the 77 other spectra. For each of our spec- 
tra, with true unabsorbed spectrum c,(/l), we randomly selected 
spectrum j within the 78 spectra from .Paris et al.. (2011.) . and 
used as an unabsorbed spectrum c,(/l) X Cp(^^(/l)/c^^^(A). Since 

there is much less uncertainty on F{z) we used the true value 
of this function. This results in an rms of (2.73 + 0.19)% for 
the BAO scale, i.e. a factor 1.20 + 0.11 larger than what is ob- 
tained when using the true unabsorbed spectrum, and the bias is 
+0.45 ± 0.2 0%. In addi t ion, m ost of the unabsorbed spectrum 
estimates of iParis et al.l (1201 lb are quite good while a few are 
grossly wrong. Therefore a significant improvement is to be ex- 



J.M. Le GofF et al.: simulations of quasar Ly-ff survey 




Fig. 10. Confidence level contours in the (wo,Wa) parameter 
plane for Planck + BOSS LRG + BOSS Ly-a. 



pected if a criteria can be found to identify a-priori the spectra for 
which PCA will fail so that a diff'erent approach can be used for 
these few spectra . Finally, note tha t the PCA unabsorbed spec- 
trum estimates of iParis et alj ( 1201 lb were obtained for high S /N 
spectra, but PCA estimates might not be very sensitive to addi- 
tional noise. 



5. Constraint on cosmological parameters 

The observation of the transverse and radial BAO scales provides 
a measurement of djiz)! s and dniz)! s, respectively, where dj{z) 
is the comoving angular distance, dniz) the Hubble distance and 
s the sonic horizon at decoupling. For a flat ACDM universe dj 
and dn read 



-£ 



(c/HQ)dz 
E(z) 



dH = 



E(z) 



(8) 



BOSS and Planck Fisher matrices allows us to compute the er- 
rors on dark energy parameters. If we define the factor of merit 
as the inverse of the 1-cr uncertainty ellipse in the (wq, Wg) plane, 
we get 34 for Planck and BOSS LRG survey. When we add 
BOSS Ly-a survey this increases to 48. The corresponding con- 
fidence level contours are plotted in Fig.fTOl 



6. Summary and conclusion 

In this paper we have investigated the possibility to constrain 
the BAO scale from the quasar Ly-a forest and in particular the 
precision that will be reached by a survey such as BOSS. To 
this aim, we have simulated realistic mock quasar spectra mim- 
icking the survey. The volume of the largest N-body simulations 
being too small for such a study, we resorted to Gaussian random 
fields, combined with lognormal approximation and FGPA. We 
investigate the effect of noise, peculiar velocity and random un- 
absorbed quasar spectra generated using a principal component 
analysis (PCA). 

The power spectrum of the transmitted flux fraction in the 
Ly-ff forest was thus computed and was either fitted in two di- 
mensions to reconstruct the radial and transverse BAO scales, or 
averaged over angles and fitted in one dimension to reconstruct 
the isotropic BAO scale. This was done over 100 realizations, 
resulting in an rms of 2.3% on the isotropic BAO scale, or 6.8% 
and 3.9% on the transverse and radial BAO scales determina- 
tions separately. This is compa tible with analytical estimates by 
Mc Donald & Eisensteiiil (l2007h . The BAO features are detected 
with an average Ax^ of 17 and the FOM for dark energy pa- 
rameter determination improves from 34 for Planck and BOSS 
LRG survey to 48 when the Ly-ff survey constraint is included. 
We note, however, that Ax^ varies significantly between realiza- 
tions, ranging from 2 to 35. 

These above estimates were obtained assuming a perfect 
knowledge of the quasar unabsorbed spectrum. Errors on the es- 
timate of the unabsorbed spectrum will increase the errors on the 
BAO scale by 10 to 20% and result in sub percent biases, overall 
quite small compared to the statistical error. Note that the effect 
of the presence of damped Ly-a and metal lines on the BAO 
measurement was not included in our mocks. 

Acknowledgements. We would like to thank Rupert Croft, Andreu Font-Ribera, 
Patrick McDonald and Anze Slosar for useful discussions and the last two for 
comments to the paper. 



where E(z) - V^a + ^m(1 + z)^. 

To estimate the sensitivity to parameters describing the dark 
energy eq uation of state, p = wp, we follow the procedure ex- 
plained in iBlake & Glazebrookl (l2003h . We introduce the z de- 
pendence of w as w(z) - Wo -\- Wa ■ z/il + z) and replace Oa in 
£(z)inEq.[8]by: 



f^A 



QaCxp 



r 1 + w(z') 

Jo 1+z' 



dz' 



(9) 



Using the relative errors on the transverse (6.79%) and radial 
(3.86%) BAO scales, obtained for the nominal setup (Table [1] 
line 4), and taking into account the anticorrelation, we can com- 
pute the Fisher matrix for the five cosmological parameters Q.,„, 
Sib, h, Wq and Wa- We also use the Fisher matrix for Plan ck mis- 
sion computed for the Euclid proposal (Laureiis /009) which 
assumes a flat universe and involves the 8 parameters: 0,„, Q/,, 
h, Wq, Wa, cr%, Hs (spectral index of the primordial power spec- 
trum) and T (optical depth to the last-scatter surface). Combining 
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